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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/567 , 749 



DATE: 02/28/2006 
TIME: 12:03:09 



Input Set : A:\pto.da.txt 

Output Set: Nj\CRF4\02282006\J567749.raw 

3 <110> APPLICANT: Degussa AGVl20> Process for the preparation of L-threonine<*l30> 

030235 

C--> 4 <140> CURRENT APPLICATION NUMBER: US/10/567,749 
C--> 4 <141> CURRENT FILING DATE: 2006-02-10 



B--> 4 <160> NUMBER OF SEQ ID: 10 ^170 > 



Patentln version 3.1<ll0> 1<211> 9933c2i: 



DNJ&21 



E--> 92 <210> SEQ ID NO: 2»11> 33C*212> PR1#213> Escherichia coli<K00> 2 

b~> 93 Does Not Comply 

*- > 93 - , Corrected Diskette Needed 

E--> 93 <400> SEQUENCE: 

93 Met Ser Gin Asn Thr Leu Lys Val His Asp Leu Asn Glu Asp Ala Glu 

94 1 5 10 15 

96 Phe Asp Glu Asn Gly Val Glu Val Phe Asp Glu Lys Ala Leu Val Glu 

97 20 25 30 

99 Gin Glu Pro Ser Asp Asn Asp Leu Ala Glu Glu Glu Leu Leu Ser Gin 

100 35 40 45 



ted utSKew rw 



102 


Gly 


Ala 


Thr 


Gin 


Arg 


Val 


Leu 


Asp Ala 


Thr Gin 


Leu 


Tyr 


Leu 


Gly 


Glu 


103 




50 










55 








60 










105 


He 


Gly 


Tyr 


Ser 


Pro 


Leu 


Leu 


Thr 


Ala 


Glu Glu 


Glu 


Val 


Tyr 


Phe 


Ala 


106 


65 










70 








75 










80 


109 


Arg 


Arg 


Ala 


Leu 


Arg 


Gly 


Asp 


Val 


Ala 


Ser Arg 


Arg 


Arg 


Met 


He 


Glu 


110 










85 










90 








95 




112 


Ser 


Asn 


Leu 


Arg 


Leu 


Val 


Val 


Lys 


He 


Ala Arg 


Arg 


Tyr 


Gly 


Asn 


Arg 


113 








100 










105 








110 






115 


Gly 


Leu 


Ala 


Leu 


Leu 


Asp 


Leu 


He 


Glu 


Glu Gly 


Asn 


Leu 


Gly 


Leu 


He 


116 






115 










120 








125 








118 


Arg 


Ala 


val 


Glu 


Lys 


Phe 


Asp 


Pro 


Glu 


Arg Gly 


Phe 


Arg 


Phe 


Ser 


Thr 


119 




130 










135 








140 






* 




121 


Tyr 


Ala 


Thr 


Trp 


Trp 


He 


Arg 


Gin 


Thr 


He Glu 


Arg 


Ala 


He 


Met 


Asn 


122 


145 










150 








155 










160 


124 


Gin 


Thr 


Arg 


Thr 


He 


Arg 


Leu 


Pro 


lie 


His He 


Val 


Lys 


Glu 


Leu 


Asn 


125 










165 










170 








175 




127 


Val 


Tyr 


Leu 


Arg 


Thr 


Ala 


Arg 


Glu 


Leu 


Ser His 


Lys 


Leu 


Asp 


His 


Glu 


128 








180 










185 








190 






130 


Pro 


Ser 


Ala 


Glu 


Glu 


He 


Ala 


Glu 


Gin 


Leu Asp 


Lys 


Pro 


Val 


Asp 


Asp 


131 






195 










200 








205 








133 


Val 


Ser 


Arg 


Met 


Leu 


Arg 


Leu 


Asn 


Glu 


Arg He 


Thr 


Ser 


Val 


Asp 


Thr 


134 




210 










215 








220 
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PATENT APPLICATION: US/10/S67 , 749 



DATB: 02/28/2006 
TIME: 12:03:09 



Input Set : A:\pto.da.txt 

Output Set: N:\CRF4\02282006\J567749.raw 

136 Pro Leu Gly Gly Asp Ser Glu Lys Ala Leu Leu Asp lie Leu Ala Asp 

137 225 230 235 240 

139 Glu Lys Glu Asn Gly Pro Glu Asp Thr Thr Gin Asp Asp Asp Met Lys 

140 245 250 255 

142 Gin Ser lie Val Lys Trp Leu Phe Glu Leu Asn Ala Lys Gin Arg Glu 

143 260 265 270 

145 Val Leu Ala Arg Arg Phe Gly Leu Leu Gly Tyr Glu Ala Ala Thr Leu 

146 275 280 285 

148 Glu Asp Val Gly Arg Glu lie Gly Leu Thr Arg Glu Arg Val Arg Gin 

149 290 295 300 

151 He Gin Val Glu Gly Leu Arg Arg Leu Arg Glu He Leu Gin Thr Gin 

152 305 310 315 320 

154 Gly Leu Asn He Glu Ala Leu Phe Arc GJu ^ i 

155 325^ >vPc4.^Aei^3ft ZmAJ^ S / 
B--> 157 <210> SBQ ID NO: 3&211> 993<212> DN«M13> Escherichia coli<220>W2] 

158 <223> OTHER INFORMATION : rpoS allele 
W--> 160 < 
E--> 162 - 



Le^2 



- ■ t 



• » 



B--> 



B- 
E 
B 
B 



•> 
•> 
■> 
•> 



B--> 



:--> 



162 <400> SEQUENCE: 3 

163 atgagtcaga atacgctgaa agttcatgat ttaaatgaag atgcggaatt tgatgagaac 
165 ggagttgagg tttttgacga aaaggcctta gtagaatagg aacccagtga taacgatttg 
167 gccgaagagg aactgttatc gcagggagcc acacagcgtg tgttggacgc gactcagctt 
169 taccttggtg agattggtta ttcaccactg ttaacggccg aagaagaagt ttattttgcg 
171 cgtcgcgcac tgcgtggaga tgtcgcctct cgccgccgga tgatcgagag taacttgcgt 
173 ctggtggtaa aaattgcccg ccgttatggc aatcgtggtc tggcgttgct ggaccttatc 
175 gaagagggca acctggggct gatccgcgcg gtagagaagt ttgacccgga acgtggtttc 
177 cgcttctcaa catacgcaac ctggtggatt cgccagacga ttgaacgggc gattatgaac 
179 caaacccgta ctattcgttt gccgattcac atcgtaaagg agctgaacgt ttacctgcga 
181 accgcacgtg agttgtccca taagctggac catgaaccaa gtgcggaaga gatcgcagag 
183 caactggata agccagttga tgacgtcagc cgtatgcttc gtcttaacga gcgcattacc 
185 tcggtagaca ccccgctggg tggtgattcc gaaaaagcgt tgctggacat cctggccgat 
187 gaaaaagaga acggtccgga agataccacg caagatgacg atatgaagca gagcatcgtc 
189 aaatggctgt tcgagctgaa cgccaaacag cgtgaagtgc tggcacgtcg attcggtttg 
191 ctggggtacg aagcggcaac actggaagat gtaggtcgtg aaattggcct cacccgtgaa 
193 cgtgttcgcc agattcaggt tgaaggcctg cgccgtttgc gcgaaatcct gcaaacgcag 
195 gggctgaata tcgaagcgct gttccpcgag taa^ ^ ^ 

197 <210> SBQ ID NO: 4&211> 79W12> DNtfc213> Escherichia colf<220>€221> tRH2re222> 

198 <223> OTHER INFORMATION: supB allele 
200 « 
200 
200 

200 <400> SEQUENCE : 4 

201 tggggtatcg ccaagcggta aggcaccgga ttctaattcc ggcattccga ggttcgaatc 
203 ctcgtacccc agcca / 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
993 / 



( 5<21 



207 <210> SEQ ID NO: 5&211> 1545<212> DNA%213> Escherichia coif 

208 <223> OTHER INFORMATION: ilvA-Gen 
210 



.<f 2 0^ 



60 
75 
21> CDS 
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RAW SEQUENCE LISTING DATE: 02/28/2006 

PATENT APPLICATION: US/10/567,749 TIME: 12:03:09 

Input Set : A:\pto.da.txt 

Output Set: N:\CRF4\02282006\J567749.raw 

B- - > 210 - 
B--> 210 * 

B--> 210 <400> SEQUENCE : 5 

211 atg get gac teg caa ccc ctg tec ggt get ccg gaa ggt gee gaa tat 48 

212 Met Ala Asp Ser Gin Pro Leu Ser Gly Ala Pro Glu Gly Ala Glu Tyr 

213 15 10 15 

215 tta aga gca gtg ctg cgc gcg ccg gtt tac gag gcg gcg cag gtt acg 96 

216 Leu Arg Ala Val Leu Arg Ala Pro Val Tyr Glu Ala Ala Gin Val Thr 

217 20 25 30 

219 ccg eta caa aaa atg gaa aaa ctg teg teg cgt ctt gat aac gtc att 144 

220 Pro Leu Gin Lys Met Glu Lys Leu Ser Ser Arg Leu Asp Asn Val lie 

221 35 40 45 

223 ctg gtg aag cgc gaa gat cgc cag cca gtg cac age ttt aag ctg cgc 192 

224 Leu Val Lys Arg Glu Asp Arg Gin Pro Val His Ser Phe Lys Leu Arg 

225 50 55 60 

227 ggc gca tac gec atg atg gcg ggc ctg acg gaa gaa cag aaa gcg cac 240 

228 Gly Ala Tyr Ala Met Met Ala Gly Leu Thr Glu Glu Gin Lys Ala His 

229 65 70 75 80 

?3? * qr.a Z"*- ^ac gcg cag ggc T. r gcg ttt- 2?? t: . >j* 

2*2 oi> •' VwT,.' Tii¥ Axa Ser Ala oiy aen" his Ala Gin* Gly Val AI =t fc Piie * 'J - * ' t'^ 

233 85 90 95 

235 tct tct gcg egg tta ggc gtg aag gec ctg ate gtt atg cca ace gee 336 

236 Ser Ser Ala Arg Leu Gly Val Lys Ala Leu He Val Met Pro Thr Ala 

237 100 105 110 

239 acc gee gac ate aaa gtc gac gcg gtg cgc ggc ttc ggc ggc gaa gtg 384 

240 Thr Ala Asp He Lys Val Asp Ala Val Arg Gly Phe Gly Gly Glu Val 

241 115 120 125 

243 ctg etc cac ggc gcg aac ttt gat gaa gcg aaa gee aaa gcg ate gaa 432 

244 Leu Leu His Gly Ala Asn Phe Asp Glu Ala Lys Ala Lys Ala He Glu 

245 130 135 140 

247 ctg tea cag cag cag ggg ttc acc tgg gtg ccg ccg ttc gac cat ccg 480 

248 Leu Ser Gin Gin Gin Gly Phe Thr Trp Val Pro Pro Phe Asp His Pro 

249 145 150 155 160 

251 atg gtg att gee ggg caa ggc acg ctg gcg ctg gaa ctg etc cag cag 528 

252 Met Val He Ala Gly Gin Gly Thr Leu Ala Leu Glu Leu Leu Gin Gin 

253 165 170 175 

255 gac gec cat etc gac cgc gta ttt gtg cca gtc ggc ggc ggc ggt ctg 576 

256 Asp Ala His Leu Asp Arg Val Phe Val Pro Val Gly Gly Gly Gly Leu 

257 180 185 190 

259 get get ggc gtg gcg gtg ctg ate aaa caa ctg atg ccg caa ate aaa 624 

260 Ala Ala Gly Val Ala Val Leu He Lys Gin Leu Met Pro Gin He Lys 

261 195 200 205 

264 gtg ate gec gta gaa gcg gaa gac tec gec tgc ctg aaa gca gcg ctg 672 

265 Val He Ala Val Glu Ala Glu Asp Ser Ala Cys Leu Lys Ala Ala Leu 

266 210 215 220 

268 gat gcg ggt cat ccg gtt gat ctg ccg cgc gta ggg eta ttt get gaa 720 

269 Asp Ala Gly His Pro Val Asp Leu Pro Arg Val Gly Leu Phe Ala Glu 

270 225 230 235 240 

272 ggc gta gcg gta aaa cgc ate ggt gac gaa acc ttc cgt tta tgc cag 768 
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273 Gly 


Val 


Ala 


Val 


Lys Arg He 


Gly 


Asp 


Glu Thr 


Phe 


Arg Leu 


Cys Gin 




274 










245 






250 








255 






276 


gag 


tat 


etc 


gac 


gac ate ate 


acc 


gtc 


gat age 


gat 


gcg 


ate 


tgt 


gcg 


816 


277 


Glu 


Tyr 


Leu 


Asp 


Asp He He 


Thr 


Val 


Asp Ser 


Asp 


Ala 


He 


Cys 


Ala 




278 








260 






265 








270 








280 


gcg 


atg 


aag 


gat 


tta ttc gaa 


gat 


gtg 


cgc gcg 


gtg 


gcg 


gaa 


ccc 


tct 


864 


281 


Ala 


Met 


Lys 


Asp 


Leu Phe Glu 


Asp 


Val 


Arg Ala 


Val 


Ala 


Glu 


Pro 


Ser 




282 






275 






280 








285 










264 


ggc 


gcg 


ctg 


gcg 


ctg gcg gga 


atg 


aaa 


aaa tat 


ate 


gee 


ctg 


cac 


aac 


912 


285 Gly 


Ala 


Leu 


Ala 


Leu Ala Gly 


Met 


Lys 


Lys Tyr 


He 


Ala 


Leu 


His 


Asn 




286 




290 






295 








300 












288 


att 


cgc 


ggc 


gaa 


egg ctg gcg 


cat 


att 


ctt tec 


ggt 


gec 


aac 


gtg 


aac 


960 


289 


He 


Arg Gly 


Glu 


Arg Leu Ala 


His 


He 


Leu Ser 


Gly 


Ala 


Asn 


Val 


Asn 




290 


305 








310 






315 










320 




292 


ttc 


cac 


ggc 


ctg 


cgc tac gtc 


tea 


gaa 


cgc tgc 


gaa 


ctg 


ggc 


gaa 


cag 


1008 


293 


Phe 


His Gly 


Leu 


Arg Tyr Val 


Ser 


Glu 


Arg Cys 


Glu 


Leu Gly 


Glu Gin 




294 










325 






330 








335 






296 


cgt 


gaa 


gcg 


ttg 


ttg gcg gtg 


acc 


att 


ceg gaa 


gaa 


aaa 


ggc 


age 


ttc 


1056 








Via 


T.<?T1 


Z+*\ JLV. "?1 

• m m 


Thr 


H* 


Pro Gl". 


Gin. 


X.ys 


Gly 




°h<? 








. » • 


* 


3V0 % 








■ 


• 


SSCT 




* 




300 


etc 


aaa 


ttc 


tgc 


caa ctg ctt 


ggc 


999 


cgt teg 


gtc 


acc 


gag 


ttc 


aac 


1104 


301 


Leu 


Lys 


Phe 


Cys 


Gin Leu Leu 


Gly 


Gly 


Arg Ser 


Val 


Thr 


Glu 


Phe 


Asn 




302 






355 






360 








365 










304 


tac 


cgt 


ttt 


gee 


gat gee aaa 


aac 


gee 


tgc ate 


ttt 


gtc 


ggt 


gtg 


cgc 


1152 


305 


Tyr 


Arg 


Phe 


Ala 


Asp Ala Lys 


Asn 


Ala 


Cys He 


Phe 


val 


Gly 


val Arg 




306 




370 






375 








380 












308 


ctg 


age 


cgc 


ggc 


etc gaa gag 


cgc 


aaa 


gaa att 


ttg 


cag 


atg 


etc 


aac 


1200 


309 


Leu 


Ser Arg 


Gly 


Leu Glu Glu 


Arg 


Lys 


Glu He 


Leu 


Gin 


Met 


Leu 


Asn 




310 


385 








390 






395 










400 




312 


gac 


ggc 


ggc 


tac 


age gtg gtt 


gat 


etc 


tec gac 


gac 


gaa 


atg 


gcg 


aag 


1248 


313 


Asp 


Gly Gly 


Tyr 


Ser Val Val 


Asp 


Leu 


Ser Asp 


Asp 


Glu 


Met 


Ala 


Lys 




314 










405 






410 








415 






316 


eta 


cac 


gtg 


cgc 


tat atg gtc 


ggc 


gga 


cgt cca 


teg 


cat 


ceg 


ttg 


cag 


1296 


317 


Leu 


His 


Val 


Arg 


Tyr Met Val 


Gly 


Gly Arg Pro 


Ser 


His 


Pro 


Leu 


Gin 




316 








420 






425 








430 








320 


gaa 


cgc 


etc 


tac 


age ttc gaa 


ttc 


ceg 


gaa tea 


ceg 


ggc 


gcg 


ctg 


ctg 


1344 


321 


Glu 


Arg 


Leu 


Tyr 


Ser Phe Glu 


Phe 


Pro 


Glu Ser 


Pro 


Gly 


Ala 


Leu 


Leu 




322 






435 






440 








445 










324 


cgc 


ttc 


etc 


aac 


acg ctg ggt 


acg 


tac 


tgg aac 


att 


tct 


ttg 


ttc 


cac 


1392 


325 


Arg 


Phe 


Leu 


Asn 


Thr Leu Gly 


Thr 


Tyr 


Trp Asn 


He 


Ser 


Leu 


Phe 


His 




326 




450 






455 








460 












329 


tat 


cgc 


age 


cat 


ggc acc gac 


tac 


999 


cgc gta 


ctg 


gcg 


gcg 


ttc 


gaa 


1440 


330 


Tyr 


Arg 


Ser 


His 


Gly Thr Asp 


Tyr 


Gly Arg . Val 


Leu 


Ala 


Ala 


Phe 


Glu 




331 


465 








470 






475 










480 




333 


ctt 


ggc 


gac 


eat 


gaa ceg gat 


ttc 


gaa 


acc egg 


ctg 


aat 


gag 


ctg 


ggc 


1488 


334 


Leu 


Gly Asp 


His 


Glu Pro Asp 


Phe 


Glu Thr Arg 


Leu 


Asn 


Glu 


Leu Gly 




335 










485 






490 








495 






337 


tac 


gat 


tgc 


cac 


gac gaa acc 


aat 


aac 


ceg gcg 


ttc 


agg 


ttc 


ttt 


ttg 


1536 


338 


Tyr 


Asp Cys 


His 


Asp Glu Thr 


Asn 


Asn 


Pro Ala 


Phe 


Arg 


Phe 


Phe 


Leu 
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339 
341 
342 



500 505 



510 



gcg ggt tag 
Ala Gly 




1545 



B--> 345 <210> SBQ XD NOt 6<211> 51*<212> PRT<213> Escherichia coli 
B--> 346 - 

K--> 346 „ _ „ 
B -> 346 • 



346 


<400> SEQUENCE: 


6 








347 


Met 


Ala 


Asp 


Ser 


Gin Pro 


Leu 


Ser Gly 


Ala Pro Glu Gly Ala Glu Tyr 


348 


1 








5 






10 15 


350 


Leu 


Arg 


Ala 


Val 


Leu Arg 


Ala 


Pro Val 


Tyr Glu Ala Ala Gin Val Thr 


351 








20 






25 


30 


3S3 


Pro 


Leu 


Gin Lys 


Met Glu 


Lys 


Leu Ser 


Ser Arg Leu Asp Asn val He 


354 






35 








40 


45 


356 


Leu 


Val 


Lys 


Arg 


Glu Asp Arg 


Gin Pro 


Val His Ser Phe Lys Leu Arg 


357 




50 








55 




60 


359 


Gly 


Ala 


Tyr Ala 


Met Met 


Ala 


Gly Leu 


Thr Glu Glu Gin Lys Ala His 


360 


65 








70 






75 80 


*?62 

» 


Gly 

• 








ft 1 a Co-*- 

* 




Gly .Asp. 


His A: a CI: ./Uy Val Ala Phe 


jbj * 


• 






sr 








365 


Ser 


Ser 


Ala 


Arg. 


Leu Gly Val 


Lys Ala 


Leu He Val Met Pro Thr Ala 


366 








100 






105 


110 


368 


Thr 


Ala 


Asp 


He 


Lys Val 


Asp 


Ala val 


Arg Gly Phe Gly Gly Glu Val 


369 






115 








120 


125 


371 


Leu 


Leu 


His 


Gly 


Ala Asn 


Phe 


Asp GlU 


Ala Lys Ala Lys Ala He Glu 


372 




130 








135 




140 


374 


Leu 


Ser 


Gin 


Gin 


Gin Gly Phe 


Thr Trp 


Val Pro Pro Phe Asp His Pro 


375 


145 








150 






155 160 


377 


Met 


Val 


He 


Ala 


Gly Gin Gly 


Thr Leu 


Ala Leu Glu Leu Leu Gin Gin 


378 










165 






170 175 


380 


Asp 


Ala 


His 


Leu 


Asp Arg 


Val 


Phe Val 


Pro Val Gly Gly Gly Gly Leu 


381 








180 






185 


190 


383 


Ala 


Ala 


Gly 


Val 


Ala Val 


Leu 


He Lys 


Gin Leu Met Pro Gin He Lys 


384 






195 








200 


205 


386 


Val 


He 


Ala 


Val 


Glu Ala 


Glu 


Asp Ser 


Ala Cys Leu Lys Ala Ala Leu 


387 




210 








215 




220 


390 


Asp 


Ala 


Gly 


His 


Pro Val 


Asp 


Leu Pro 


Arg Val Gly Leu Phe Ala Glu 


391 


225 








230 






235 240 


393 


Gly 


Val 


Ala 


Val 


Lys Arg 


He 


Gly Asp 


Glu Thr Phe Arg Leu Cys Gin 


394 










245 






250 255 


396 


Glu 


Tyr 


Leu 


Asp 


Asp He 


He 


Thr Val 


Asp Ser Asp Ala He Cys Ala 


397 








260 






265 


270 


399 


Ala 


Met 


Lys 


Asp 


Leu Phe 


Glu 


Asp Val 


Arg Ala Val Ala Glu Pro Ser 


400 






275 








280 


285 


402 Gly 


Ala 


Leu 


Ala 


Leu Ala Gly 


Met Lys 


Lys Tyr He Ala Leu His Asn 


403 




290 








295 




300 


405 


He 


Arg 


Gly 


Glu 


Arg Leu 


Ala 


His He 


Leu Ser Gly Ala Asn Val Asn 


406 


305 








310 






315 320 


408 


Phe 


His 


Gly 


Leu 


Arg Tyr Val 


Ser Glu 


Arg Cys Glu Leu Gly Glu Gin 
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B-- 
W-- 
W-- 
B-- 
B-- 



409 
411 
412 
414 
415 
417 
418 
420 
421 
423 
424 
426 
427 
429 
430 
432 
433 
435 
4** 6. 

439 
441 
442 
444 

> 447 

> 449 

> 452 

> 455 

> 455 

> 455 
455 
456 
457 
458 
460 
461 
462 
464 
465 
466 
468 
469 
470 
472 
473 
474 
476 
477 
478 



325 

Arg Glu Ala Leu Leu Ala 

340 

Leu Lys Phe Cys Gin Leu 
355 

Tyr Arg Phe Ala Asp Ala 
370 

Leu Ser Arg Gly Leu Glu 
385 390 
Asp Gly Gly Tyr Ser Val 

405 

Leu His Val Arg Tyr Met 

420 

Glu Arg Leu Tyr Ser Phe 
435 

Arg Phe Leu Asn Thr Leu 
450 

Tyr Arg Ser His Gly Thr 
465 470 

485 

Tyr Asp Cys His Asp Glu 

500 

Ala Gly j£ 
<210> SEQ ID NO* 7^211> 



330 335 
Val Thr lie Pro Glu Glu Lys Gly Ser Phe 

345 350 
Leu Gly Gly Arg Ser Val Thr Glu Phe Asn 

360 365 
Lys Asn Ala Cys lie Phe Val Gly Val Arg 
375 380 
Glu Arg Lys Glu lie Leu Gin Met Leu Asn 

395 400 
Val Asp Leu Ser Asp Asp Glu Met Ala Lys 

410 415 
Val Gly Gly Arg Pro Ser His Pro Leu Gin 

425 430 
Glu Phe Pro Glu Ser Pro Gly Ala Leu Leu 

440 445 

Gly Thr Tyr Trp Asn lie Ser Leu Phe His 

455 460 

Asp Tyr Gly Arg Val Leu Ala Ala Phe Glu 

475 480. 

/Csp -PHe- oiu Tar Axg-Ceir^n*tntf^eu*"GIy' 

490 495 
Thr Asn Asn Pro Ala Phe Arg Phe Phe Leu 
505 510 

154&212> DNA<213> Escherichia coll 

L 



f 5 



* 

<213> ORGANISM: 
<400> SEQUENCE: 

atg get gac teg 
Met Ala Asp Ser 
1 

tta aga gca gtg 
Leu Arg Ala Val 

20 

ccg eta caa aaa 
Pro Leu Gin Lys 
35 

ctg gtg aag cgc 
Leu Val Lys Arg 

50 

ggc gea tac gee 
Gly Ala Tyr Ala 
65 

ggc gtg ate act 
Gly Val lie Thr 



7 

caa 
Gin 
5 

ctg 
Leu 

atg 
Met 



ccc 



ctg 
Leu 



tec 
Ser 



cgc gcg ccg 
Arg Ala Pro 



gaa aaa ctg 
Glu Lys Leu 
40 

gaa gat cgc cag 
Glu Asp Arg Gin 



ggt 
Gly 

gtt 

Val 

25 

teg 

Ser 



get 
Ala 
10 



ccg gaa ggt gee 
Pro Glu Gly Ala 



Tyr Glu 



teg 
Ser 



atg 
Met 

get 
Ala 
85 



atg 

Met 

70 

tct 

Ser 



gcg 
Ala 

gcg 
Ala 



ggc 
Gly 

ggt 
Gly 



cca gtg 
Pro Val 

ctg acg 
Leu Thr 

aac cac 
Asn His 
90 



cgt 
Arg 

cac 
His 

gaa 

Glu 

75 

gcg 

Ala 



gcg 
Ala 

ctt 
Leu 

age 
Ser 

60 

gaa 

Glu 



Ala 

gat 
Asp 

45 

ttt 

Phe 



cag 

Gin 

30 

aac 

Asn 



Lys 



cag aaa 
Gin Lys 



cag ggc gtc 
Gin Gly Val 



gaa 

Glu 

15 

gtt 

val 

gtc 
Val 

ctg 
Leu 

gcg 
Ala 

gcg 
Ala 
95 



tat 
Tyr 

acg 
Thr 

att 

lie 

cgc 



His 
80 
ttt 
Phe 




48 



96 



144 



192 



240 



288 
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480 tCt tCt 


gcg egg tta 


ggc gtg 


aag 


gcc 


ctg 


ate 


gtt 


atg cca 


acc 


gcc 


336 


481 Ser Ser Ala Arg Leu 


Gly Val 


Lys 


Ala 


Leu 


He 


Val 


Met 


Pro 


Thr 


Ala 




482 


100 






105 










110 








484 acc gcc 


gac ate aaa 


gtc gac 


gcg 


gtg 


cgc 


ggc 


ttc 


ggc 


ggc 


gaa 


gtg 


384 


485 Thr Ala Asp lie Lys 


Val Asp Ala 


Val 


Arg 


Gly 


Phe 


Gly Gly 


Glu 


Val 




486 


115 




120 










125 










488 ctg etc 


cac ggc gcg 


aac ttt 


gat 


gaa 


gcg 


aaa 


gcc 


aaa 


gcg 


ate 


gaa 


432 


489 Leu Leu His Gly Ala 


Asn Phe 


Asp 


Glu 


Ala 


Lys 


Ala 


Lys Ala 


lie 


Glu 




490 130 




135 










140 












492 ctg tea 


cag cag cag 


ggg ttc 


acc 


tgg 


gtg 


ccg 


ccg 


ttc 


gac 


cat 


ccg 


480 


493 Leu Ser 


Gin Gin Gin 


Gly Phe 


Thr 


Trp 


Val 


Pro 


Pro 


Phe Asp 


His 


Pro 




494 145 




150 








155 










160 




496 atg gtg 


att gcc ggg 


caa ggc 


acg 


ctg 


gcg 


ctg 


gaa 


ctg 


etc 


cag 


cag 


528 


497 Met val 


He Ala Gly 


Gin Gly Thr 


Leu 


Ala 


Leu 


Glu 


Leu 


Leu 


Gin 


Gin 




498 


165 








170 










175 






500 gac gcc 


cat etc gac 


cgc gta 


ttt 


gtg 


cca 


gtc 


ggc 


ggc 


ggc 


ggt 


ctg 


576 


501 Asp Ala His Leu Asp 


Arg Val 


Phe 


Val 


Pro 


Val 


Gly 


Gly Gly 


Gly 


Leu 




502 


180 






185 










190 








504 ^ct get. 




»-»♦-*-» ot- rt 

^ • c» •**.*• 


ate 






;irg 


atg 


C<v».- 


/raa 






OA 




-vJiy vVcF Ala 


; Wi" ne'er 


TTe 




Leu*. 




• * 

■ 


506 


195 




200 










205 










509 gtg ate 


gcc gta gaa 


gcg gaa 


gac 


tec 


gcc 


tgc 


ctg 


aaa 


gca 


gcg 


ctg 


672 


510 Val He 


Ala val Glu 


Ala Glu 


Asp 


Ser 


Ala 


Cys Leu 


Lys Ala Ala 


Leu 




511 210 




215 










220 












513 gat gcg 


ggt cat ccg 


gtt gat 


ctg 


ccg 


cgc 


gta 


ggg 


eta 


ttt 


get 


gaa 


720 


514 Asp Ala 


Gly His Pro 


Val Asp 


Leu 


Pro 


Arg 


Val 


Gly 


Leu 


Phe 


Ala 


Glu 




515 225 




230 








235 










240 




517 ggc gta 


gcg gta aaa 


cgc ate 


ggt 


gac 


gaa 


acc 


ttc 


cgt 


tta 


tgc 


cag 


768 


518 Gly Val 


Ala Val Lys 


Arg He Gly 


Asp 


Glu 


Thr 


Phe 


Arg 


Leu Cys 


Gin 




519 


245 








250 










255 






521 gag tat 


etc gac gac 


ate ate 


acc 


gtc 


gat 


age 


gat 


gcg 


ate 


tgt 


gcg 


816 


522 Glu Tyr 


Leu Asp Asp 


He He 


Thr 


val 


Asp 


Ser Asp 


Ala 


He 


Cys 


Ala 




523 


260 






265 










270 








525 gcg atg 


aag gat tta 


ttc gaa 


gat 


gtg 


cgc 


gcg 


gtg 


gcg 


aaa 


ccc 


tct 


864 


526 Ala Met 


Lys Asp Leu 


Phe Glu Asp 


Val 


Arg 


Ala 


Val 


Ala 


Lys 


Pro 


Ser 




527 


275 




280 










285 










529 ggc gcg 


ctg gcg ctg 


gcg gga 


atg 


aaa 


aaa 


tat 


ate 


gcc 


ctg 


cac 


aac 


912 


530 Gly Ala 


Leu Ala Leu 


Ala Gly Met 


Lys 


Lys 


Tyr 


He 


Ala 


Leu 


His 


Asn 




531 290 




295 










300 












533 att cgc 


ggc gaa egg 


ctg gcg 


cat 


att 


ctt 


tec 


ggt 


gcc 


aac 


gtg 


aac 


960 


534 He Arg 


Gly Glu Arg 


Leu Ala 


His 


He 


Leu 


Ser Gly 


Ala 


Asn 


Val 


Asn 




535 305 




310 








315 










320 




537 ttc cac 


ggc ctg cgc 


tac gtc 


tea 


gaa 


cgc 


tgc 


gaa 


ctg 


ggc 


gaa 


cag 


1008 


538 Phe His 


Gly Leu Arg 


Tyr Val 


Ser 


Glu 


Arg 


Cys 


GlU 


Leu Gly Glu 


Gin 




539 


325 








330 










335 






541 cgt gaa 


gcg ttg ttg 


gcg gtg 


acc 


att 


ccg 


gaa 


gaa 


aaa 


ggc 


age 


ttc 


1056 


542 Arg Glu 


Ala Leu Leu 


Ala Val 


Thr 


He 


Pro 


Glu 


Glu 


Lys Gly Ser 


Phe 




543 


340 






345 










350 








545 etc aaa 


ttc tgc caa 


ctg ctt 


ggc 


ggg 


cgt 


teg 


gtc 


acc 


gag 


ttc 


aac 


1104 
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546 
547 
549 
550 
551 
553 
554 



557 
558 
559 
561 
562 
563 
565 
566 
567 
569 
570 

• * 

D / J. 

574 
575 
576 
578 
579 
580 
582 
583 
584 
586 
587 

> 590 

> 591 

> 591 

> 591 

> 591 
592 
593 
595 
596 
598 
599 
601 
602 
604 
605 
607 
608 
610 



tac 
Tyr 

ctg 
Leu 
385 

gac 
Asp 

eta 
Leu 

gaa 
Glu 

cgc 
ftrg 

• 

tat 
Tyr 
465 
ctt 
Leu 



cgt 
Arg 
370 
age 
Ser 

ggc 
Gly 

cac 
His 

cgc 
Arg 

ttc 
P*:e 
450 
cgc 
Arg 



gtg 
Val 

etc 
Leu 
435 
etc 



Cys 


Gin 


Leu 


Leu 


Gly Gly 
360 


Arg 


Ser 


Val 


Thr 
365 


Glu 


Phe 


Asn 




gec 


gat 


gee 


aaa 


aac gec 


tgc 


ate 


ttt 


gtc 


ggt 


gtg 


cgc 


1152 


Ala Asp 


Ala 


Lys 


Asn Ala Cys 


He 


Phe 


val 


Gly 


Val Arg 










375 








380 












ggc 


etc 


gaa 


gag 


cgc aaa 


gaa 


att 


ttg 


cag 


atg 


etc 


aac 


1200 


Gly Leu 


Glu 


Glu 


Arg Lys 


Glu 


He 


Leu 


Gin 


Met 


Leu 


Asn 








390 








395 










400 




tac 


age 


gtg gtt 


gat etc 


tec 


gac 


gac 


gaa 


atg 


gcg 


aag 


1248 


Tyr 


Ser 


val 


Val 


Asp Leu 


Ser 


Asp 


Asp Glu 


Met 


Ala Lys 






405 








410 






• 




415 






cgc 


tat 


atg 


gtc 


ggc gga 


cgt 


cea 


teg 


cat 


ccg 


ttg 


cag 


1296 


Arg 


Tyr 


Met 


Val 


Gly Gly Arg 


Pro 


Ser 


His 


Pro 


Leu 


Gin 




420 








425 










430 








tac 


age 


ttc 


gaa 


ttc ccg 


gaa 


tea 


ccg 


ggc 


gcg 


ctg 


ctg 


1344 


Tyr 


Ser 


Phe 


Glu 


Phe Pro 


Glu 


Ser 


Pro Gly 


Ala 


Leu 


Leu 










440 








445 










aac 


acg 


ctg 


ggt 


acg tac 


tgg 


aac 


att 


tct 


ttg 


ttc 


cac 


1392 


Asn 


Thr 










Asn 


Tie 


Ser 


Lei} 


Phe 


» • • 










455" 








*60— ■ 










cat 


ggc 


acc 


gac 


tac ggg 


cgc 


gta 


ctg 


gcg 


gcg 


ttc 


gaa 


1440 


His Gly 


Thr Asp Tyr Gly Arg 


Val 


Leu Ala 


Ala 


Phe 


Glu 








470 








475 










480 




cat 


gaa 


ccg 


gat 


ttc gaa 


acc 


egg 


ctg 


aat 


gag 


ctg ggc 


1488 


His 


Glu 


Pro Asp 


Phe Glu 


Thr 


Arg 


Leu 


Asn 


Glu 


Leu Gly 






485 








490 










495 






cac 


gac 


gaa 


acc 


aat aac 


ccg 


gcg 


ttc 


agg 


ttc 


ttt 


ttg 


1536 


His Asp 


Glu 


Thr 


Asn Asn 


Pro 


Ala 


Phe 


Arg 


Phe 


Phe 


Leu 




500 








505 










510 









gcg ggt tag 
Ala Gly 

<210> SBQ ID NO 



8<£211> 



514<212 > FRT&13> Escherichia coli 



<400> SEQUENCE : 

Met Ala Asp Ser 
1 

Leu Arg Ala Val 

20 

Pro Leu Gin Lys 
35 

Leu Val Lys Arg 
50 

Gly Ala Tyr Ala 
65 

Gly Val He Thr 
Ser Ser Ala Arg 



8 

Gin Pro Leu 

5 

Leu Arg Ala 

Met Glu Lys 

Glu Asp Arg 
55 

Met Met Ala 
70 

Ala Ser Ala 
85 

Leu Gly Val 



Ser Gly Ala 
10 

Pro val Tyr 
25 

Leu Ser Ser 
40 

Gin Pro Val 

Gly Leu Thr 

Gly Asn His 
90 

Lys Ala Leu 



Pro Glu Gly Ala 

Glu Ala Ala Gin 

30 

Arg Leu Asp Asn 
45 

His Ser Phe Lys 
60 

Glu Glu Gin Lys 
75 

Ala Gin Gly Val 
He Val Met Pro 



Glu Tyr 

15 

Val Thr 

Val He 

Leu Arg 

Ala His 

80 
Ala Phe 

95 

Thr Ala 



; • 3 
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6X1 




100 








613 


Thr 


Ala Asp He 


Lys 


Val 


Asp Ala 


614 




115 






120 


616 


Leu 


Leu His Gly 


Ala 


Asn 


Phe Asp 


617 




130 






135 


619 


Leu 


Ser Gin Gin 


Gin 


Gly 


Phe Thr 


620 


145 






150 




622 


Met 


Val He Ala 


Gly 


Gin 


Gly Thr 


623 






165 






625 


Asp 


Ala His Leu 


Asp 


Arg 


Val Phe 


626 




180 








629 


Ala 


Ala Gly Val 


Ala 


Val 


Leu He 


630 




195 






200 


632 


Val 


He Ala Val 


Glu 


Ala 


Glu Asp 


633 


• 


210 






215 


635 Asp 


Ala Gly His 


Pro 


Val 


Asp Leu 


636 


225 






230 




638 Gly 


Val Ala Val 


Lys 


Arg 


He Gly 






* mm 1 • 


245 




• * 




v5Tu 


fy'ir £eu- Aspr-ftsp^Xxe 




642 




260 








644 


Ala 


Met Lys Asp 


Leu 


Phe 


Glu Asp 


645 




275 






280 


647 Gly 


Ala Leu Ala 


Leu 


Ala 


Gly Met 


648 




290 






295 


650 


He 


Arg Gly Glu Arg 


Leu 


Ala His 


651 


305 






310 




653 


Phe 


His Gly Leu 


Arg 


Tyr 


Val Ser 


654 






325 






656 


Arg 


Glu Ala Leu 


Leu 


Ala 


Val Thr 


657 




340 








659 


Leu 


Lys Phe Cys 


Gin 


Leu Leu Gly 


660 




355 






360 


662 


Tyr 


Arg Phe Ala Asp 


Ala 


Lys Asn 


663 




370 






375 


665 


Leu 


Ser Arg Gly Leu 


Glu 


Glu Arg 


666 


385 






390 




668 Asp 


Gly Gly Tyr Ser 


Val 


Val Asp 


669 






405 






671 


Leu 


His Val Arg Tyr 


Met 


Val Gly 


672 




420 








674 


Glu 


Arg Leu Tyr 


Ser 


Phe 


Glu Phe 


675 




435 






440 


677 Arg 


Phe Leu Asn 


Thr 


Leu Gly Thr 


678 




450 






455 


680 


Tyr 


Arg Ser His Gly 


Thr 


Asp Tyr 


681 


465 






470 




683 


Leu 


Gly Asp His Glu 


Pro 


Asp Phe 


684 






485 







105 






110 


Val Arg 


Gly 


Phe Gly 


Gly Glu Val 






125 




Glu Ala 


Lys 


Ala Lys 


Ala He Glu 






140 




Trp Val 


Pro 


Pro Phe 


Asp His Pro 




155 




160 


Leu Ala 


Leu 


Glu Leu 


Leu Gin Gin 


170 




* 


175 


Val Pro 


Val 


Gly Gly 


Gly Gly Leu 


185 






190 


Lys Gin 


Leu 


Met Pro 


Gin He Lys 






205 




Ser Ala 


Cys 


Leu Lys 


Ala Ala Leu 






220 




Pro Arg 


Val 


Gly Leu 


Phe Ala Glu 




235 




240 


Asp GlU 


Thr 


Phe Arg 


Leu Cys Gin 













v a i **£Sp" Sf?r-Asp x -Ai 4 a -Tie " cys" 


"Ala" 


265 




270 






Val 


Arg 


Ala Val Ala Lys 


Pro 


Ser 






285 






Lys 


Lys 


Tyr He Ala Leu 


His 


Asn 






300 






He 


Leu 


Ser Gly Ala Asn 


Val 


Asn 






315 




320 


Glu 


Arg 


Cys Glu Leu Gly 


Glu 


Gin 




330 


• 


335 




He 


Pro 


Glu Glu Lys Gly 


Ser 


Phe 


345 




350 






Gly 


Arg 


Ser Val Thr Glu 


Phe 


Asn 






365 






Ala 


Cys 


He Phe Val Gly 


Val 


Arg 






380 






Lys 


Glu 


He Leu Gin Met 


Leu 


Asn 






395 




400 


Leu 


Ser 


Asp Asp Glu Met 


Ala 


Lys 




410 




415 




Gly 


Arg 


Pro Ser His Pro 


Leu 


Gin 


425 




430 






Pro 


Glu 


Ser Pro Gly Ala 


Leu 


Leu 






445 






Tyr 


Trp 


Asn He Ser Leu 


Phe 


His 






460 






Gly 


Arg 


Val Leu Ala Ala 


Phe 


Glu 






475 




480 


Glu 


Thr 


Arg Leu Asn Glu 


Leu Gly 




490 




495 
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583 



686 Tyr Asp Cys His Asp Glu Thr Asn Asn Pro Ala Phe Arg Phe Phe Leu 

687 500 505 510 
689 Ala Gly ^ / 

B--> 692 <210> SEQ ID NO: 9V?211> 1548?212> DNJK213:> Escherichia coli 
W--> 694 <220> 7KATURB^221> D1U&222> (1) • . (1548)&223> 
W--> 695 - • 
E--> 69' 

E--> 698 . * * 

B--> 698 

698 <400> SEQUENCE: 9 

699 tcgcgatctg gtactgtaag gggaaataga gatgacacac gataataaat tgcaggttga 60 
701 agctattaaa cgcggcacgg taattgacca tatccccgcc cagatcggtt ttaagctgtt 120 
703 gagtctgttc aagctgaccg aaacggatca gcgcatcacc attggtctga acctgccttc 180 
705 tggcgagatg ggccgcaaag atctgatcaa aatcgaaaat acctttttga gtgaagatca 240 

~ 707 agtagatcaa ctggcattgt atgcgccgca agccacggtt aaccgtatcg acaactatga 300 
709 agtggtgggt aaatcgcgcc caagtctgcc ggagcgcatc gacaatgtgc tggtctgccc 360 
711 gaacagcaac tgtatcagcc atgccgaacc ggtttcatcc agctttgccg tgcgaaaacg 420 
713 cgccaatgat atcgcgctca aatgcaaata ctgtgaaaaa gagttttccc ataatgtggt 480 

7X5 gc\gg'-\:r..**-.' taattgcgg*. *.ZZ*"*?**** *rfi-+?gp*:c. cctata at£ mr x i . 1* . s 

717 1 

719 act ttt tac cgc tgt aat aaa gga gaa ate atg age aaa act ate gcg 

720 Thr Phe Tyr Arg Cys Asn Lys Gly Glu lie Met Ser Lys Thr He Ala 

721 5 10 15 

723 acg gaa aat gca ccg gca get ate ggt cct tac gta cag ggc gtt gat 631 

724 Thr Glu Asn Ala Pro Ala Ala He Gly Pro Tyr Val Gin Gly Val Asp 

725 20 25 30 35 

727 ctg ggc aat atg ate ate ace tec ggt cag ate ccg gta aat ccg aaa 679 

728 Leu Gly Asn Met He He Thr Ser Gly Gin He Pro Val Asn Pro Lys 

729 40 45 50 

731 acg ggc gaa gta ccg gca gac gtc get gca cag gca cgt cag teg ctg 727 

732 Thr Gly Glu Val Pro Ala Asp val Ala Ala Gin Ala Arg Gin Ser Leu 

733 55 60 65 

735 gat aac gta aaa gcg ate gtc gaa gee get ggc ctg aaa gtg ggc gac 775 

736 Asp Asn Val Lys Ala lie Val Glu Ala Ala Gly Leu Lys Val Gly Asp 

737 70 75 80 

739 ate gtt aaa act ace gtg ttt gta aaa gat ctg aac gac ttc gca ace 823 

740 He Val Lys Thr Thr Val Phe Val Lys Asp Leu Asn Asp Phe Ala Thr 

741 85 90 95 

743 gta aac gee act tac gaa gec ttc ttc ace gaa cac aac gec ace ttc 871 

744 Val Asn Ala Thr Tyr Glu Ala Phe Phe Thr Glu His Asn Ala Thr Phe 

745 100 105 110 115 

747 ccg gca cgt tct tgc gtt gaa gtt gec cgt ctg ccg aaa gac gtg aag . 919 

748 Pro Ala Arg Ser Cys Val Glu Val Ala Arg Leu Pro Lys Asp Val Lys 

749 120 125 130 

752 att gag ate gaa gcg ate get gtt cgt cgc taa tcttgatgga aatceggget 972 

753 He Glu lie Glu Ala He Ala Val Arg Arg 

754 135 140 

756 atcatgcccg gattaagtct gatgacaaac geaaaatege ctgatgeget aegcttatea 1032 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/567,749 



DATE: 02/28/2006 
TIME: 12:03:09 



Input Set : A:\pto.da.txt 

Output Set: N:\CRP4\02282006\J567749.raw 

758 ggcctacgtg attcctgcaa tttattgaat ttgttggccg gataaggcat ttacgccgca 1092 

760 tccggcatga acaaaactca ctttgtctac aatctgaatc ggggctatcg tgcccagttt 1152 

762 attctttatt gccagccgta acgacggcta tagaaccctt tcaccaactg ggttaatgtc 1212 

764 atataccctg ccagaatcgc aaccagccac gggaaatagc ttaacggcag cgcctgtaat 1272 

766 tgcagataac tggccagcgg tgaaaacggc aatgcgatcc cgacaatcat cacgatcacg 1332 

768 gtcatgatca ttaacggcca cgatgcacag ctctgaataa acggcacacg gcgggtgcgg 1392 

770 atcatatgca caatcagcgt ttgcgacagt aagcccacca caaaccatcc cgactggaac 1452 

772 agcgtttgcg tttccggcgt gttggcatgg aatacccacc acatcaggca aaacgtcaaa 1512 

774 atatcgaaga tcgagctgafc cggtccg^ag aagatp 1548 

B--> 777 <210> SBQ ID NO: 1<TC211> 141<212> PRT$213>. Escherichia coli 

B--> 77b - 

E--> 778 * 

E--> 778 :: 



778 


<400> SEQUENCE: 


10 














779 


Met 


Ser Gin 


Thr 


Phe Tyr 


Arg 


Cys 


Asn 


Lys Gly 


Glu He 


Met Ser Lys 


780 


1 






5 








10 




15 


782 


Thr 


He Ala 


Thr 


Glu Asn 


Ala 


Pro 


Ala 


Ala He 


Gly Pro 


Tyr Val Gin 


783 






20 








25 






30 


* * * - 




Vh3. Asp.Len 


Gly . Asn 


*• 


••■».- 


Tie 


Thr 


•■•■.» — •* 




/ OO 




So ' 














45" 




788 


Asn 


Pro Lys 


Thr 


Gly Glu 


Val 


Pro 


Ala 


Asp Val 


Ala Ala 


Gin Ala Arg 


789 




50 






55 








60 




791 


Gin 


Ser Leu 


Asp 


Asn Val 


Lys 


Ala 


He 


Val Glu 


Ala Ala 


Gly Leu Lys 


792 


65 






70 








75 




80 


794 


Val 


Gly Asp 


He 


Val Lys 


Thr 


Thr 


Val 


Phe Val 


Lys Asp 


Leu Asn Asp 


795 














90 




95 


797 


Phe 


Ala Thr 


Val 


Asn Ala 


Thr 


Tyr 


Glu 


Ala Phe 


Phe Thr 


Glu His Asn 


798 






100 








105 






110 


800 


Ala 


Thr Phe 


Pro 


Ala Arg 


Ser 


Cys 


Val 


Glu Val 


Ala Arg 


Leu Pro Lys 


801 




115 








120 






125 




803 


Asp 


Val Lys 


He 


Glu He 


Glu 


Ala 


He 


Ala Val 


Arg Arg 




804 




130 






135 








140 
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» ' VERIFICATION SUMMARY DATE: 02/28/2006 

PATENT APPLICATION: US/10/567,749 TIME: 12:03:11 

Input Set : A:\pto.da.txt 
Output Set: N:\CRF4\02282006\J567749.raw 

L:4 M:270 C: Current Application Number differs, Replaced Current Application No 
L:4 M:27l C: Current Filing Date differs. Replaced Current Filing Date 
L:0 M:282 E: Numeric Field Identifier Missing, <120> TITLE INVENTION 
L:0 M:201 W: Mandatory field data missing, <130> FILE REFERENCE 
L:4 M:283 W: Missing Blank Line separator, <160> field identifier 
L:5 M:281 W: Numeric Fields not Ordered, <220> not ordered!. 
L:5 M:283 W: Missing Blank Line separator, <220> field identifier 
L:5 M:256 W: Invalid Numeric Header Field, <220> has non- blank data 
L:7 M:282 E: Numeric Field Identifier Missing, <210> is required. 
L:7 M:282 E: Numeric Field Identifier Missing, <211> is required. 
L:7 M-.282 E: Numeric Field Identifier Missing, <212> is required. 
L:7 M:282 E: Numeric Field Identifier Missing, <213> is required. 

L:7 M:212 E: (34) Invalid or duplicate Sequence ID Number, SEQUENCE ID NOS:0 differs :1 
L:92 M:212 E: (34) Invalid or duplicate Sequence ID Number, SEQ ID NO: 2<2ll> 330<212> 
PRT<213> Escherichia coli<400> 2 

L:93 M:282 E: Numeric Field Identifier Missing, <211> is required. 
L:93 M:282 E: Numeric Field Identifier Missing, <212> is required. 
L:93 M:282 E: Numeric Field Identifier Missing, <213> is required. . 

T..:93 V:2 r " :V Mandator:/ «->&der . Pi*»M missing, <400> is r*nr!v*' - 
£,: 1ST wRJXZ"ST i'SWj r^vaTiiT or" duplicate 3equencier-ru Numoer-, S3Q 'It- 'tiCn~2<7LT> y*?<2Xr>" h r''rr:t^ 
DNA<213> Escherichia coli<220><221> Allele<222> 

L:160 M:256 W: Invalid -Numeric Header Field, <220> has non-blank data 
L:162 M:282 E: Numeric Field Identifier Missing, <2ll> is required. 
L:162 M:282 E: Numeric Field Identifier Missing, <212> is required. 
L:162 M:282 E: Numeric Field Identifier Missing, <213> is required. 

L:197 M:212 E: (34) Invalid or duplicate Sequence ID Number, SEQ ID NO: 4<211> 75<212> 
DNA<213> Escherichia coli<220><221> tRNA<222> 

L:200 M:282 E: Numeric Field Identifier Missing, <211> is required. 
L:200 M:282 E: Numeric Field Identifier Missing, <212> is required. 
L:200 M:282 E: Numeric Field Identifier Missing, <213> is required. 
L:200 M:200 E: Mandatory Header Field missing, <220> Tag not found for SEQ ID#:4 
L:207 M:212 E: (34) Invalid or duplicate Sequence ID Number, SEQ ID NO: 5<2ll> 1545<212> 
DNA<213> Escherichia coli<220><221> CDS<222> 

L:210 M:282 E: Numeric Field Identifier Missing, <211> is required. 
L:210 M:282 E: Numeric Field Identifier Missing, <212> is required. 
L:210 M:282 E: Numeric Field Identifier Missing, <213> is required. 
L:210 M:200 E: Mandatory Header Field missing, <220> TaTg not found for SEQ ID#:5 
L:345 M:212 E: (34) Invalid or duplicate Sequence ID Number, SEQ ID NO: 6<211> 514<212> 
PRT<213> Escherichia coli 

L:346 M:282 E: Numeric Field Identifier Missing, <211> is required. 
L:346 M:282 E: Numeric Field Identifier Missing, <212> is required. 
L:346 M:282 E: Numeric Field Identifier Missing, <213> is required. 
L:346 M:283 W: Missing Blank Line separator, <400> field identifier 

L:447 M:212 B: (34) Invalid or duplicate Sequence ID Number, SBQ ID NO: 7<211> 1545<212> 
DNA<213> Escherichia coli 

L:449 M:256 W: Invalid Numeric Header Field, <220> has non-blank data 
L:452 M:256 W: Invalid Numeric Header Field, <220> has non-blank data 
L:455 M:282 E: Numeric Field Identifier Missing, <211> is required. 
L:455 M:282 E: Numeric Field Identifier Missing, <212> is required. 
L:455 M:282 E: Numeric Field Identifier Missing, <213> is required. 

L:590 M:212 E: (34) Invalid or duplicate Sequence ID Number, SBQ ID NO: 8<211> 514<212> 
PRT<213> Escherichia coli 

L:591 M:282 E: Numeric Field Identifier Missing, <211> is required. 
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L:591 M:282 E: Numeric Field Identifier Missing, <212> is required. 
L:591 M:282 E: Numeric Field Identifier Missing, <213> is 
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VERIFICATION SUMMARY DATE: 02/28/2006 

PATENT APPLICATION: US/10/567 , 749 TIMB: 12:03:11 

Input Set : A:\pto.da.txt 

Output Set: N:\CRF4\02282006\J567749.raw 

L:591 M:283 W: Missing Blank Line separator, <400> field identifier 

L:692 M:212 E: (34) Invalid or duplicate Sequence ID Number, SEQ ID NO: 9<211> 1548<212> 
DNA<213> Escherichia coli 

L:694 M:256 W: Invalid Numeric Header Field, <220> has non-blank data 
L:695 M:283 W: Missing Blank Line separator, <220> field identifier 
L:695 M:256 W: Invalid Numeric Header Field, <220> has non-blank data 
L:698 M:282 E: Numeric Field Identifier Missing, <211> is required. 
L:698 M:282 E: Numeric Field Identifier Missing, <212> is required. 
L:698 M:282 E: Numeric Field Identifier Missing, <213> is required. 

L:777 M:212 E: (34) Invalid or duplicate Sequence ID Number, SEQ ID NO: 10<211> 141<212> 
PRT<213> Escherichia coli 

L:778 M:282 E: Numeric Field Identifier Missing, <211> is required. 
L:778 M:282 E: Numeric Field Identifier Missing, <212> is required. 
L:778 M:282 E: Numeric Field Identifier Missing, <213> is required. 
L:778 M:283 W: Missing Blank Line separator, <400> field identifier 

L:4 M:203 E: No. of Seq. differs, <160> Number Of Sequences : Input (10) Counted (9) / 
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